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® Generating efficient code for a computer with dissimilar register spaces. 

_©.A-compiler_generates_compiled^object_code from. source code of a computer program in a manner that 
produces efficient object code for a computer with dissimilar register spaces. The technique comprising the 
steps of 

generating code that references symbolic registers in which the register class is not distinguished, 

making entries in a table denoting the context in which each symbolic register occurs and constructing an 

equivalence tree of symbolic registers for move instructions assigned to a same equivalence class, 

for each equivalence class, forming the logical OR function of register usage information for all symbolic 

registers in the class, and for each symbolic register that appears in more than one register space context, 

generating new symbolic register numbers so that there is one number for each register space, and storing the 

numbers in said table, and 

if a definition point of a symbolic register is encountered and that symbolic register is used in more than one 
J register space context, inserting code in said program to either do the same operation as is done at the 
C definition point in each register space or move a value in the symbolic register from one space to another. The 
improvement achieved is in object code space and time of execution. 
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GENERATING EFFICIENT CODE FOR A COMPUTER WITH DISSIMILAR REGISTER SPACES 

This invention generally reiates to computer ***** sp^s. 
an opting compiler to generate more ^J^^^^i^^ set computers (RISC) 
The invention has specific application to optimizing compilers for reduced insm. 

but could be used with advantage in any compiler. imnrnvP the aualitv of the object 

„,m £e Pardee ft* sonre opsone « .on,, tad one n e » ^ 

-n ssr-'K - *~ - - -~ 

^»r^*!^3S'l™-*»> » pro-d. - tm p,dven,en, in both she dbjee. cod. soooe end «rne 
'° 7f etdToW. o. « i-eosoo to provide a MM. .or 9.0O„*9 e-noien, oode .or . compter 
with dissimilar register spaces. rtK i*rt rnrie from source code of a computer 

sr^rx M or i^sr^cXr sr . — . - — 

achieved is in object code space and time of e xecution understood 

drawing, in ^ ^ ^ stnjcwe 0| . ^ ^ m oom|<hr «lu««lno 

where the present invention fits into this structure; 

Fig 2 is a flow diagram illustrating the logic of the present invention; 
Fig 3 is a detailed flow diagram of the pass 1 logic shown in Fig. 2; 
Fig 4 is a detailed flow diagram of the interlude logic shown in Fig. 2; and 

* ReSin^w^ 

of an" opting compiler. As is typica. of such ' compos ^ P-ess^g.ns JJ£i£i5£ 

45 mus far i's conventional and well understood by those skilled in the design - 

The improvement according to the presen, "J^««™ ^e's Insert^ Z£Z£m 
renamed maximally. Next, the program is analyzed m block 5 and ood^imra 
spaces. This process is illustrated in more dettul m Fig. ^J*^*^ frZnation. This information 

so Is^hen "processed "in block^ 1 b^fo™* 0 ^ 1 *^ ^3rf^^ow ^proppwn"is^TwdTin^WTCk 
pass. 
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Returning now to Fig. 1 of the drawings, when the processing of Fig. 2 is completed, dead code in the 
program is removed in block 6. This is desirable because the steps of Fig. 2, block 13, may have inserted 
some unnecessary (i.e., "dead") code, and a simple way to detect and remove it is to perform the well- 
known optimization of dead code removal at this point. Register assignments are made in block 7 on the 
basis of the code as it exists to that point. Finally, the assembly and final code generation is performed in 
block 8 to complete the process. 

Further elaborating, the technique according to the invention is embodied in the following steps: 

1. First, code is generated that references symbolic registers, in which the register class is not 
distinguished. That is, a symbolic register such as R100 might later become assigned to a GPR, to an FPR 
or to a CR, depending on how it is used. This is generally illustrated at block 2 in Fig. 1. The usual 
optimizations such as code motion and commoning are done on this code, in block 3. The code at this 
point should have distinct symbolic register numbers whenever possible (a concept called the "maximal 
number of names"), so that step (4) below will only insert extra instructions when there is a high probability 
that they are necessary. This is an observation affecting only compile time, not object code quality. 

2. Next, as illustrated in block 1 1 in Fig. 2, a pass is made over the program, noting (in a table) the 
context in which each symbolic register (SR) appears. This is a simple linear pass, ignoring the branching 
structure and the definition-use relation. During this pass, move register (MR) instructions are treated 
specially. The instruction "MR rx,ry" does not denote any particular register space for rx or ry, but it does 
mean that they are in the same spaces. To handle this, when "MR rx.ry" is encountered, we put rx and ry 
in the same equivalence class. Any standard algorithm for defining equivalence classes of the entries in a 
table may be used, for example Algorithm E of Knuth, The Art of Computer Programming, Fundamental 
Algorithms, Vol. 1, section 2.3.3 on page 354. This simple algorithm constructs a tree for each equivalence 
class, by putting a pointer from rx to ry in the table. 

3. Next, the processing of block 12 in Fig. 2 is made by a pass over the table. For each equivalence 
class, the logical OR of the register usage information for all SRs in the class is formed. Then, for each SR 
that appears in more than one register space context, we generate new SR numbers, so that there is one 
for each register space. These new SR numbers are saved in the table. 

4. Next, the second linear pass shown in block 13 in Fig. 2 is made over the program in which "fix 
up" code is inserted. If a definition point of an SR is encountered, and that SR is used (anywhere in the 
programs) in more than one register space context, then code is inserted to either (a) do the same 
operation as is done at the definition point, in each register space, or (b) move the value from one space to 
another. During this second pass, if a "use" point of an SR is encountered, and that SR appears in more 
than one context, then the SR of the use is changed to be the appropriate SR number for the context of the 
use. 

5. Lastly, the standard optimization of "dead code elimination" is done as shown in block 6 of Fig. 1 , 
and then register allocation* as -shown in block 7 in. Fig. 1. Register allocation is run separately on each 
register space. 

Steps 2 to 4 above form this invention, whereas steps 1 and 5 are a common part of an optimizing 
compiler. The "fix up" code could be inserted either at definition points or at use points. Inserting it at 
definition points (as described above) is usually best, because often one definition point feeds several uses, 
and because sometimes the definition is a simple one. such as setting a variable equal to a constant, that 
can be done in any of the register spaces (avoiding a "move" from one space to another). 

The logic of pass 1. represented by block 11 in Fig. 2. is shown in more detail in Fig. 3. to which 
reference is now made. The process begins by retrieving the first or next instruction in function block 15. A 
test is made in decision block 16 to determine if the instruction is fake. If it is, the process jumps to 
decision block 25; otherwise, a test is made in decision block 17 to determine if the instruction is a move 
register (MR) instruction. If it is, the source and target registers are put in the same equivalence class by 
making the higher numbered one point to the lower numbered one in function block 18. The process then 
jumps to decision block 25. If, on the other hand, the test in decision block 17 is negative, a test is made in 
decision block 19 to determine if the instruction is a conditional branch instruction. If it is. the "CR context" 
for the tested register is noted in function block 20. and then the process jumps to decision block 25. If the 
instruction is not a conditional branch instruction, a test is made in decision block 21 to determine if the 
instruction is a compare instruction. If so, the "CR context" for the target register is noted in function block 
22 In either case, the "FPR context" is noted for all FPRs referenced in function block 23. Next, the "GPR 
context" is noted for all GPRs referenced in function block 24. Finally, in decision block 25 a test is made to 
determine if there are any more operations to be processed. If there are, the process loops back to function 
block 1 5- otherwise, control passes to the interlude logic, shown in Fig. 4. 

In Fig. 4, the interlude process begins by initializing in function block 27; that is, i is set equal to the 
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22 the of me test in decision block 37 is negative, at which point centre, passes to the log.c 
° f P £££Tl£ pa 9 ss 5 2 begins by retrieving the first or next instruction in function block 42. A test is 

££ET?lt is ^ LS b ock 45 code is inserted for each register space in which the target appears 

Ston btock 50 where a test is made to determine if any source register appears ,n more than one 
conS I * the register name is changed to the new name that is in the appropriate register space .for the 
ooerSon inunction block 51 . in either case, control next goes to decision block 52 where . test . mad to 
*E£L if M are any more operations to be processed. H so. the process loops back to funct,on block 

^ tZ^ZeT^T^o^ons, assume that the computer has multiple genera, ^sters and 
multole condS registers. Assume for simplicity that the condition registers are the same length <e£, 32 
KSl registers. This invention is applicable to the case in which the reg.sters are d drfferent 
£££ • M deteils are more complicated, but the principles are the same. Then some typical ^nsfojma- 
tion shown below. Here. "rx" in the left column denotes the original symbolic reg.ster. We assume that 
"^ appears in both contexts (genera, register and condition register) somewhere m *^* m J"J™ 
riohtStamn "crx" denotes the condition register counterpart of "rx". That is. when reg.sters are ass.gr.ed 
tol code in the nght column. "rx" will be assigned to a GPR. and CRX will be assigned to a condrt.cn 
register. 
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(1) shows a fixed point load instruction. Since rx appears somewhere in both contexts (we a e 
assum no) we generate a new symbolic register crx and insert a "move to condition reg.ster" mstructon to 
™ve Txt c^MTCR is . slow! undesirable instruction, but we are assuming that there .s no way to load 

725 ^edr — S We assume the computer has a fast Load Immediate into a 
CR (LICR), so we use that to set crx, rather than an MTCR. 
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(3) shows a Move Register instruction. If either rx or ry appears in both contexts, then they both will 
appear in both contexts, by the equivalence class algorithm described above. Hence, both rx and ry have 
CR counterparts crx and cry. We insert an instruction to move cry to crx. 

(4) shows a Compare instruction. We assume the target can only be a CR. Hence, we change rx to 
the CR counterpart crx. 

(5) shows a Branch on Condition. We assume the tested register rx can only be a CR. so we change 
it to the CR counterpart crx. 

Assume a program contains the source code shown in column (1) below, written in a high level 
language such as PL/1. Variable X is a logical variable (a single bit). We do not show any flow of control; the 
lines can be anywhere in the program. Some may be in loops and they are not necessarily in the order 
shown A compiler that does not incorporate the present invention would generate code similar to that 
shown in column (2). It contains an MTCR (move to condition register) instruction which we assume is 
costly to execute. A compiler that does incorporate the present invention would generate the code similar to 
that shown in column (3). This code is faster because it employs LICR (load immediate CR) instructions, 
rather than an MTCR instruction. 
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(3) 


X = 0; 


LI 


R1.0 


LI 

LICR 


R1,0 
CR1.0 


X = 1; 


LI 


R1.1 


L! 

LICR 


R1.1 
CR1.1 


IF X THEN GO TO LAB; 


MTCR 
BC 


CR1.R1 
CR1 ,LAB 


BC 


CR1 ,LAB 


Y = X; 


ST 


R1.Y 


ST 


R1.Y 



30 If the Store into Y were not present, then the code in column (3) would simplify; that is, GPR R1 would 

become dead (have no uses), so the conventional optimization of dead code elimination would eliminate the 
two LI instructions. But the code in column (2) does not simplify in this case. 

While the invention has been described in terms of a single preferred embodiment, those skilled in the 
art will recognize that the invention may be practiced with modification within the spirit and scope of the 

35 appended claims. 

Claims 

40 1 In a compiler which generates compiled object code from source code of a computer program, a 
computer performed method of generating efficient object code for a computer with dissimilar register 
spaces comprising the steps of: 

generating by a computer code that references symbolic registers in which the register class is not 
distinguished (2); 

4s making entries in a table by the computer denoting the context in which each symbolic register occurs ana 
constructing an equivalence tree of symbolic registers for move instructions assigned to a same equiv- 
alence class (11); . 
for each equivalence class, forming by the computer the logical OR function of register usage informat.on 
for all symbolic registers in the class, and for each symbolic register that appears in more than one register 

so space context, generating by the computer new symbolic register numbers so that there is one number for 
each register space, and storing the numbers in said table (12); and 

if a definition point of a symbolic register is encountered and that symbolic register is used in more than 
one register space context, inserting by the computer code in said program to either do the same operation 
as is done at the definition point in each register space or move a value in the symbolic register from one 
55 space to another (13). ^ 

2 The method of generating efficient object code for a computer as recited in claim 1 wherein said step 
of making entries in a table (11) further comprises the step of checking by the computer each instruction to 
determine if it is a move register instruction (17), and if so, putting source and target registers in a same 
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equivalence class (1 8). 

3 The method of generating efficient object code for a computer as recited in claim 1 or 2 wherein said 
step of making entries in a table (11) further comprises the step of checking by the computer each 
instruction to determine if it is a conditional branch instruction (19), and if so, noting a conditional register 
5 context for that register (20). 

4. The method of generating efficient object code for a computer as recited in one of claims 1 to d 
wherein said step of making entries in a table (11) further comprises the step of checking by the computer 
each instruction to determine if it is a compare instruction (21), and if so, noting a conditional register 
context for that register (22). 

70 5. The method of generating efficient object code for a computer as recited in one of claims 1 to 4 
wherein the step of making entries in a table (11) further comprises the steps of: 

noting by the computer a floating point register context for all referenced floating point registers (23); and 
noting by the computer a general purpose register context for all referenced general purpose registers (24). 

6. The method of generating efficient object code for a computer as recited in one of claims 1 to 5 
75 wherein the steps of forming the logical OR function and generating new symbolic register numbers (12) 

further comprise the steps of: 

testing by the computer each register to determine if it is assigned to an equivalence class (unreferenced 
block left of 30), and if not, putting that register in a: class by itself (30) and 

testing by the computer each register to determine if it appears in more than one context (38), and if so, 
20 making up new names so that there is a different one for each context (39). 

7. The method of generating efficient object code for a computer as recited in one of claims 1 to 6 
wherein said step of inserting code (13) further comprises the step of checking by the computer each 
instruction to determine if it is a move register instruction (44), and if it is, inserting by the computer code to 
do the move instruction in that register space (45). 

25 8. The method of generating efficient object code for a computer as recited in one of claims 1 to 7 
wherein said step of inserting code (13) further comprises the step of checking by the computer each target 
register to determine if that register appears in more than one context (46), and if so, determining by the 
computer if the operation can be done efficiently in alternate register spaces (47), and if so, inserting by the 
computer code to do the operation in the alternate register spaces (48), but otherwise, inserting by the 

30 computer code to move the result to the alternate register spaces (49). 

9. The method of generating efficient object code for a computer as recited in one of claims 1 to 8 
wherein said step of inserting code (13) further comprises the steps of testing by the computer each source 
register to determine if it appears in more than one context (50), and if so, changing by the computer the 
register to a new name that is in the appropriate space for a corresponding operation (51). 
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© A compiler generates compiled object code from 
source code^f ^computer program in a manner that 
produces efficient object code for a computer with 
dissimilar register spaces. The technique comprising 
the steps of 

generating code that references symbolic registers 
in which the register class is not distinguished, 
making entries in a table denoting the context in 
which each symbolic register occurs and construct- 
ing an equivalence tree of symbolic registers for 
move instructions assigned to a same equivalence 
class, 

for each equivalence class, forming the logical OR 
function of register usage information for all sym- 
bolic registers in the class, and for each symbolic 
register that appears in more than one register 
space context, generating new symbolic register 
numbers so that there is one number for each regis- 
ter space, and storing the numbers in said table, and 
if a definition point of a symbolic register is encoun- 
tered and that symbolic register is used in more than 
one register space context, inserting code in said 
program to either do the same operation as is done 
at the definition point in each register space or move 
a value in the symbolic register from one space to 
another. The improvement achieved is in object 
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